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Abstract: We examined the neural activation to consonant-vowel transitions by cortical auditory evoked potentials 
(AEPs). The aim was to show whether cortical response patterns to speech stimuli contain components due to one of the 
temporal features, the voice-onset time (VOT). In seven normal-hearing adults, the cortical responses to four different 
monosyllabic words were opposed to the cortical responses to noise stimuli with the same temporal envelope as the 
speech stimuli. Significant hemispheric asymmetries were found for speech but not in noise evoked potentials. The differ- 
ence signals between the AEPs to speech and corresponding noise stimuli revealed a significant negative component, 
which correlated with the VOT. The hemispheric asymmetries can be referred to rapid spectral changes. The correlation 
with the VOT indicates that the significant component in the difference signal reflects the perception of the acoustic 
change within the consonant-vowel transition. Thus, at the level of automatic processing, the characteristics of speech 
evoked potentials appear to be determined primarily by temporal aspects of the eliciting stimuli. 
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1. INTRODUCTION 

During the auditory perception of speech, the central 
hearing has to process complex acoustic structures in real- 
time. Adequate processing of acoustic information is the 
requirement for speech and language development [1]. Par- 
ticularly, disorders in the temporal processing are supposed 
to be responsible for deficient auditory discrimination abili- 
ties, which can result in an impaired speech perception [2]. 
Cortical auditory evoked potentials (AEPs) provide an objec- 
tive method with high temporal resolution for the investiga- 
tion of auditory speech processing. 

In the absence of attention, AEPs represent the automatic 
cortical processing on auditory sensory level. In this case, 
AEPs are dominated by obligatory components. In adults, 
these components are particularly the Nl and P2, also called 
N1/P2 complex [3]. The N1/P2 complex is thought to reflect 
the synchronous neural activity of structures in the fhalamic- 
cortical segment of the central auditory system. The Nl, a 
negative peak occurring approximately 100 ms after stimulus 
onset, is suggested to represent sound detection functions 
[4], since it is sensitive to onset sound features, such as in- 
tensity and interstimulus interval. For example, it could be 
demonstrated that the Nl indicates segmentation during the 
perception of continuous speech [5]. P2, a positive peak oc- 
curring approximately 200 ms after stimulus onset, is as- 
sumed to reflect sound content properties like acoustic or 
phonetic structures [6]. 
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Several investigators have examined the possibility of 
using AEPs to determine the individual discrimination abil- 
ity of phonetic structures in speech sounds [7-10]. Of par- 
ticular interest are stimulus features constituting minimal 
pairs, such as formant gradients of consonant vocal- 
transitions and the voice onset-time (VOT). VOT is defined 
as the interval between the release from stop closure and the 
onset of laryngeal pulsing [11]. Sharma and colleagues 
(1999) investigated the morphology of speech evoked poten- 
tials in absence of attention depending on the subjective per- 
ception of stop consonants. Analyzing the cortical responses 
to a /da/-/ta/ continuum they found that stimuli with short 
VOTs (0-30 ms) evoked a single Nl, whereas stimuli with 
long VOTs (50-80 ms) evoked two distinct negative compo- 
nents (Nl' and Nl). This discontinuity in AEP morphology 
was in line with the subjective perception of voiceless 
sounds and thus, it was suggested to represent an electro- 
physiological correlate of categorical perception [7]. How- 
ever, in another study using a /ga/-/ka/ continuum as stimuli, 
the change in Nl morphology to a double-peaked component 
did not signal subjective perception of a voiceless sound. 
This result indicated that the minimum VOT value of 40 ms 
required for the temporal separation of Nl depends on 
acoustic properties of the stimulus rather than the perceptual 
categorization [8]. 

Martin et al. investigated cortical responses to syllables 
with different formant transitions embedded in masking 
noise [12]. The stimuli evoked an Nl which systematically 
changed with the stimulus energy (which is a sound onset 
feature), but not with the individual subjective discrimina- 
tion. 

Ostroff et al. compared the cortical response to the sylla- 
ble /sei/ with the cortical responses to the sibilant I si and to 
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the vowel /ei/. They ascertained that the response to /sei/ was 
a combination of the AEPs to the onsets of the two constitu- 
ent phonemes /s/ and /ei/ [13]. These overlapping AEPs 
within one stimulus response have been termed the acoustic 
change complex (ACC) [14]. The ACC is supposed to be 
composed of different N1/P2 complexes reflecting the 
acoustic changes across the entire stimulus [15]. 

Overall, the aforementioned studies indicate that speech 
evoked potentials primarily reflect the phonetic composition 
of the stimuli. The aim of the presented study was to investi- 
gate whether cortical responses to speech stimuli contain 
AEP components referring to the VOT. For this purpose, 
AEPs to four natural monosyllabic words were compared 
with AEPs to noise stimuli showing the same temporal enve- 
lope as the word stimuli [16]. For a systematic analysis of 
the cortical responses, the speech stimuli were chosen with 
respect to varying VOTs. 

2. MATERIALS AND METHODOLOGY 

2.1. Participants 

We examined 7 normal-hearing, right-handed monolin- 
gual native German speakers (3 female, 4 male). The age 
range was 22 to 27 years. None was on medication at that 
time. All participants signed an informed consent. 

2.2. Stimuli and Procedure 

In this study, two types of stimuli were used to elicit 
electrophysiologic responses: speech stimuli and noise sound 
stimuli. The speech stimuli consisted of four monosyllabic 
words naturally produced by a male speaker, taken from 
Freiburger speech discrimination test [17]. The stimuli were 
Ei /a:i/, Bett /bet/, Dieb /di:b/, and Pult /pult/, which mean 
egg, bed, thief and desk. The durations of the stimuli Ei, 
Bett, Dieb, and Pult were 544 ms, 430 ms, 501 ms, and 567 
ms, respectively. The speech stimuli exhibited a bandwidth 
of 8 kHz and were digitalized with 14 bit at a sampling rate 
of 20,000 s" . Except Ei, all speech stimuli started with an 
initial stop consonant and were chosen with respect to their 
varying VOT. The VOTs were determined as VOT(£7) = 0 
ms, VOT(Bett) = 35 ms, VOT(Dieb) = 60 ms, and 
VOT(Pult) = 80 ms. The onset times of the closing conso- 
nants which succeed the vowels were determined as 215 ms 
with Bett, 390 ms with Dieb, and 395 ms with Pult. 



The speech stimuli served as raw material for the synthe- 
sis of the noise stimuli. Noise stimuli were created by ran- 
domly multiplying the discrete samples of the speech stimuli 
with ± 1 and subsequent 8 kHz low pass filtering. Fig. (1) 
shows exemplarily the generation of the noise stimulus cor- 
responding to Dieb. Thus, any phonetic information was 
removed from the speech stimuli, whereas the intensity was 
held constant. By this means, the synthesized noise stimuli 
Ei N , Bett N , Dieb N , and Pult N v/ere generated. 

Spectrograms of all presented stimuli are depicted in Fig. 
(2). The speech stimuli mainly consisted of frequencies 
lower than 2 kHz, whereas the noise stimuli were distributed 
over the entire bandwidth of 8 kHz. 

For equated presentation loudness, the applied stimuli 
were calibrated with respect to their root mean squares to a 1 
kHz reference tone with an intensity level of 70 dB SPL (see 
[18]). Stimuli were presented in four blocks of 320 sounds 
each. Within the block, all stimuli were successively pre- 
sented 40 times in the order of speech and corresponding 
noise stimulus (speech-noise pairs). Thus, each stimulus was 
presented 160 times. The interstimulus interval varied ran- 
domly between 1400 and 2100 ms. Subjects were tested in a 
soundproof booth. In order to minimize effects of the sub- 
jects' state of attention on the applied stimuli, they were 
asked to watch a silent movie presented on a TV screen. 

2.3. EEG Recording and Data Processing 

The EEG was derived with Ag/AgCl electrodes, which 
were integrated in a cap (Braincap, Brain Products GmbH, 
Gilching, Germany) with 30 fixed electrode positions (Fpl, 
Fp2, F3, F4, C3, C4, P3, P4, 01, 02, F7, F8, T3, T4, P7, P8, 
Fz, FCz, Cz, Pz, FC1, FC2, CP1, CP2, FC5, FC6, CP5, CP6, 
TP9, TP 10). The electrode impedances were kept below 5 
kOhm. For eye artefact rejection, an electro-oculogram was 
recorded by an electrode placed under the right eye. The 
EEG was recorded with Brain Amp-MR (Brain Products 
GmbH, Gilching, Germany) at a sampling rate of 500 Hz and 
digitalization of 16 bit. 

During data acquisition, all channels were referenced to 
FCz. Offline, data were re-referenced to the mean of TP9 
and TP 10. The EEG was 0.13 Hz - 20 Hz band pass filtered 
with a slope of 12 dB/octave. The recording window in- 
cluded 100 ms prestimulus and 700 ms poststimulus time. 



Speech stimulus: Corresponding noise 

Dieb stimulus: Dieb N 
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Fig. (1). Speech stimulus Dieb and synthesized noise stimulus DiebN- Both stimuli exhibit the same temporal envelope but phonetic informa- 
tion is discarded in the noise stimulus. 
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Fig. (2). Spectral view of the presented stimuli. Top: Natural spoken monosyllabic words. Bottom: Synthetic noise sounds, derived from the 
monosyllabic words. Bandwidth of the stimuli is 8 kHz. 



Sweeps with artefacts measuring higher than 75 uV were 
rejected. The remaining sweeps were averaged separately for 
each stimulus and prestimulus baselined. 

For each stimulus, grand mean waveforms were com- 
posed and grand mean latencies of the AEP components 
were visually identified. The individual peak latencies were 
determined in a time interval of the central latencies ± 24 ms. 
Amplitudes were calculated as a mean voltage at the 20 ms 
period centered at the individual peak latencies. 

2.4. Data Analysis 

In the first step, statistical comparative tests of the AEP 
characteristics were carried out for both stimulus types, i.e. 
speech and noise stimuli, separately. In the second step, cor- 
tical responses to speech and corresponding noise sounds 
were analysed and compared among each other. 

2.4.1. Responses to Speech and Noise Stimuli 

Preliminary one-way analyses of variance (ANOVA) 
were conducted to determine the scalp distribution of the 
amplitudes of Nl and P2. Since the largest responses for all 
stimuli were obtained at central positions, channel Cz was 
chosen for following one-way ANOVAs, which examined 
Nl and P2 latencies and amplitudes of the distinct speech 
and noise elicited AEPs. Newman-Keuls tests were carried 
out as post hoc analyses for the determination of equal-mean 
subsets. For the examination of hemispheric differences, Nl- 
P2 interpeak amplitudes of the lateral scalp sites (T3, FC5, 
C3 vs. T4, FC6, C4) were compared via paired t-tests. 

2.4.2. Speech-Noise Pairs 

For each subject time intervals were determined, in 
which speech and corresponding noise responses statistically 
differed. For this purpose, the sampling values of the single 
recorded sweeps in response to each stimulus were pooled 
for every discrete time step independently, resulting in 350 



samples of speech and noise responses each (700 ms post- 
stimulus interval x 500 s" 1 EEG sampling rate). Speech re- 
sponse samples of every time step were compared to the cor- 
responding noise response sample by Student's two-tailed t- 
test. As level of significance p<0.01 was selected. In order to 
avoid Bonferroni problems due to the large number of com- 
pared samples, a statistical difference between speech and 
noise response was accepted, when at least 12 consecutive 
sampling points (i.e., 24 ms interval) showed significance 
[19]. 

On average level, difference signals were created by sub- 
tracting the noise elicited AEPs from the corresponding 
speech elicited AEPs. Latencies and amplitudes of occurring 
components in the difference signals were determined visu- 
ally. Amplitudes were tested for significant appearance via 
one-tailed t-test. 

3. RESULTS 

3.1. Responses to Speech and Noise Stimuli 

Preliminary one-way ANOVAs, which separately ana- 
lysed Nl and P2 peak amplitudes, revealed significant am- 
plitude differences between the electrode sites for both, 
speech and noise stimuli (p < 0.001, F 2 6,i62 = 3.3 - 8.6). 
Newman-Keuls post hoc analysis revealed that with all stim- 
uli, the greatest amplitudes were elicited in the central area 
of the scalp (i.e., channels Cz3, Cz, Cz4) for both, Nl and P2 
amplitude. Hence, the following illustrations and statistics 
are restricted to channel Cz. Fig. (3a) shows the grand mean 
waveforms of the cortical responses to the four different 
speech stimuli. Morphologies of the speech-evoked poten- 
tials differed clearly across the stimuli. AEPs in response to 
Bett and Pult appear to reveal multiple overlapping response 
patterns. In all subjects, the predominant components were 
Nl and P2 in all responses. All speech stimuli elicited an Nl 
component which peaked around 113 + 8 ms, and a P2 com- 
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Fig. (3). (a) Grand mean waveforms of speech stimuli elicited AEPs in response to Ei, Bett, Dieb, and Pult at channel Cz. (b) Grand mean 
waveforms of noise-sound elicited AEPs in response to Ei N , Bett N , DiebN, and Pult N at channel Cz. 



portent that peaked in a range from 180 to 260 ms around 
207 ± 36 ms. 

Fig. (3b) shows the grand mean waveforms of the corti- 
cal responses to the four different noise stimuli. The wave- 
forms of the different noise sound elicited AEPs were simi- 
lar. The predominant components in all responses were Nl 
and P2. All noise stimuli elicited an Nl component which 
peaked around 103 ± 8 ms, and a P2 component that peaked 
around 175 ± 11 ms. 

In Tables 1 and 2, Nl and P2 latencies and amplitudes of 
both stimulus types are given. Table 3 shows F values of 



one-way ANOVAs, conducted to reveal significant differ- 
ences of these characteristics. Newman-Keuls post hoc tests 
did not show consistent correlations within a subset of both 
stimulus types. As it is apparent from the higher levels of 
significance, speech evoked AEP characteristics diverged 
stronger among each other than those evoked by noise 
sounds. 

For the analysis of hemispheric asymmetries, a paired t- 
test was conducted with N1-P2 interpeak amplitudes of op- 
posite channel pairs T3 - T4, FC5 - FC6, and C3 - C4, con- 
tinuing from lateral to central. The mean N1-P2 interpeak 



Table 1. Mean Latencies of Nl and P2 in ms Elicited by the Speech/Noise Stimuli at Channel Cz 



Peak 


Stimulus Type 


Ei 


Bett 


Dieb 


Pult 


Nl 


Speech 


106(4) 


119(8) 


120 (5) 


107(7) 




Noise 


95 (8) 


111(7) 


108(7) 


97(7) 


P2 


Speech 


183 (12) 


181 (8) 


206(12) 


259(10) 




Noise 


177(15) 


181 (16) 


185 (10) 


160(12) 



Standard deviations are in parentheses. 
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Table 2. Mean Amplitudes of Nl and P2 in uV Elicited by the Speech/Noise Stimuli at Channel Cz 



Peak 


Stimulus Type 


Ei 


Bett 


Dieb 


Pult 


Nl 


Speech 


-3.82(1.29) 


-2.36(1.04) 


-4.45 (1.54) 


-2.36(1.36) 




Noise 


-3.01 (1.58) 


-2.28 (1.48) 


-2.58 (1.33) 


-2.03 (1.04) 


P2 


Speech 


4.00(1.29) 


2.75 (1.18) 


4.40(1.50) 


1.99 (0.57) 




Noise 


3.22(1.01) 


2.35 (0.85) 


4.11 (1.46) 


2.49 (0.90) 



Standard deviations are in parentheses. 

Table 3. ANOVAs F Values of the Different AEP Characteristics 





F value, Nl 


F value, P2 


Latency 


Amplitude 


Latency 


Amplitude 


Speech stimuli 


9 o,*** 


4.5* 


§4 9*** 


5 9** 


Noise stimuli 


8.4** 


0.7" 


4 7** 


3.9* 



" ! not significant, *p<0.05, **p<0.01, ***p<0.001 

Table 4. N1-P2 Interpeak Amplitudes (in uV) and t Values for the Comparison of Selected Electrode Pairs (T3-T4, FC5-FC6, C3- 
C4) 





T3 


T4 


t 


FC5 


FC6 


t 


C3 


C4 


t 


Speech 


2.29(1.18) 


3.23(1.15) 


_5 7*** 


4.80(1.13) 


4.80(1.11) 


0.0" 


5.19(1.35) 


4.98(1.26) 


29** 


Noise 


2.15(1.14) 


2.24(1.29) 


-0.5" 


3.89(1.09) 


3.83(1.20) 


0.7" 


3.98(1.21) 


3.95(1.32) 


0.5 " 



Standard deviations are in parentheses. " s not significant, *p<0.05, **p<0.01, ***p<0.001. 

amplitude of T3 was significantly smaller than that of T4 for 
the speech stimuli but not for the noise stimuli, see t values 
in Table 4. While N1-P2 interpeak amplitudes at the posi- 
tions FC5 and FC6 did not differ, an inversion from right to 
left hemispheric predominance occurred at the central posi- 
tions C3, C4 for the speech stimuli (C3 > C4, see Table 4). 
Exemplarily, cortical responses to Dieb and Dieb N across the 
entire scalp are depicted in Fig. (4), in order to illustrate the 
hemispheric asymmetries of the speech evoked potentials. 

3.2. Speech-Noise Pairs 

The AEP waveforms, in response to the speech sounds, 
were compared to those in response to the corresponding 
noise sounds. The time intervals, in which speech and corre- 
sponding noise responses significantly differed, are depicted 
in Fig. (5). Blocks of black colour mark temporal areas of 
significant differences (p<0.01) for each subject and each 
speech-noise pair. None of the speech-noise pair responses 
differs strongly during the initial 50 ms, as one can see from 
Figs. (5a-5d). The highest accumulations of areas of signifi- 
cant differences occur in the time window of 100 - 300 ms 
after stimulus onset. 

Fig. (6) shows the grand mean AEP waveforms of the 
speech (dashed lines) and noise stimuli (dotted lines) as well 
as the difference signals (solid lines) speech - noise. In the 
difference signal, a negative component could be observed in 
a latency range of approximately 140 + 15 ms. For each 
speech-noise pair, one-tailed t-tests (subjects x channels) 
showed that the occurring negative component was signifi- 
cantly smaller than zero. Latencies and amplitudes of this 
negative component are given in Table 5. Ei elicits the 
smallest latency, latencies in response to Bett and Dieb are 



similar and Pult elicits the longest latency. Fig. (7) shows the 
VOT of the four speech stimuli plotted against the peak la- 
tency of the negative component in the difference signal. A 
prominent correlation of VOT and peak latency is observ- 
able. Spearman rank correlation analysis revealed that the 
latencies were significant positively correlated with VOT (r 
= 0.66, p< 0.01). 

4. DISCUSSION 

Generally, different natural speech stimuli evoke distinct 
neural response patterns [20]. The aim of this study was to 
show whether cortical response patterns to speech stimuli 
contain components due to one of the temporal features, the 
voice-onset time. For this purpose, AEP characteristics in 
response to monosyllabic words were compared against each 
other and with cortical responses to noise stimuli with the 
same temporal envelope as the speech stimuli. 

4.1. Responses to Speech and Noise Stimuli 

In accordance with other studies (e.g. [6, 20, 21]), for 
both types of stimuli, the AEPs were maximal at central 
electrode sites. The different natural speech stimuli evoked 
distinct neural response patterns, partially showing multiple 
overlapping responses. These distinctions indicate an ex- 
tended neural activity with the auditory processing of speech 
in contrast to noise sound processing. Considering previous 
studies, a plausible explanation are the variant VOTs of the 
presented speech stimuli which affect the Nl and P2 [8, 13, 
22,23]. 

Winkler et al. demonstrated that Nl latencies evoked by 
spectral-pitch and missing-fundamental tones do not differ 
[24]. Hence, the different pitches of the presented vowels are 
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Fig. (4). Grand mean AEPs to Dieb (solid) and DiebN (dashed) over the entire scalp. 
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Fig. (5). Temporal areas of significant differences between the re- 
sponses to speech and noise stimuli at channel Cz for each subject 
(black areas). Level of significance is p<0.01. 



probably not the reason for the found variability in the 
speech evoked potentials, at least not regarding the Nl. An 
influence of the cortical response to the closing consonant on 
the P2 is not expectable due to its relatively late onset time. 

4.2. Hemispheric Asymmetries 

In contrast to noise elicited AEPs, hemispheric differ- 
ences occurred with speech evoked AEPs in the present 
study. Hemispheric left-overbalanced asymmetries with 
speech perception are well known [25-27]. Left-overbalance 
is primarily reported in studies with attentive designs. How- 
ever, focused auditory attention can selectively modulate 
automatic processing in auditory cortex and thus, affect 
AEPs [28]. 

Dehaene-Lambertz et al. demonstrated in a discrimina- 
tion task that the left-hemispheric predominance with pho- 
neme perception also can be achieved with appropriate non- 
speech stimuli [29]. Thus, specialization of left auditory cor- 
tex is not speech specific but depends on rapid spectro- 
temporal changes. However, in the present study, a right- 
hemispheric predominance was observable at temporal sites 
which inverted to a left-overbalance at central sites. This 
could be due to deviant locations in the activated auditory 
areas of both sides, resulting in different dipole orientations 
which underlie AEP derivations [30]. A more plausible 
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Fig. (6). Grand mean waveforms at channel Cz of speech evoked responses (dashed), noise evoked responses (dotted) and the difference sig- 
nal (solid). 

Table 5. Mean Latencies and Amplitudes of the Negative Component in the Difference Signal out of the Speech-Noise Pairs Aver- 
aged over all Channels 



Stimulus Type 


Ei 


Bett 


Dieb 


Pult 


Latency (ms) 
Amplitude (uV) 


123 (12) 
-1.36(1.08)*** 


138(12) 
-0.78 (1.12)*** 


142(11) 
-2.64(1.44)*** 


160(17) 
-1.81 (1.11)*** 



Standard deviations are in parentheses. Results of one-tailed / tests ***p<0.001. 



explanation is a functional asymmetry of the auditory corti- 
ces. PET findings indicated complementary specializations 
of left and right auditory cortex' belt areas [31]. Responses 
to temporal features were weighted towards the left, while 
responses to spectral features were weighted towards the 
right hemisphere. 

Thus, the spectral changes of the consonant-vowel transi- 
tions may be the reason for the observed right hemispheric 
overbalance. In contrast, noise stimuli did not evoke asym- 
metric responses. 



4.3. Speech-Noise Pairs 

Although phonetic changes were present in the speech 
stimuli during the entire propagation of approximately 500 
ms, AEPs of speech-noise pairs differed predominantly in 
the time window of the N1/P2 complex between 100 and 300 
ms (Fig. 5), reflecting distinct onset processing. 

Subtracting noise from speech elicited AEPs facilitated a 
direct comparison of the speech-noise pairs. Thus, the pre- 
sented paradigm allows observing the neural processing of 
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Fig. (7). Box and whisker plot of VOTs of the diverse speech stim- 
uli against the latencies of the negative components in the differ- 
ence signal of the cortical responses. Dashed lines within the boxes 
display the median, the edges of the boxes display the quartiles. 
Whiskers indicate maximal and minimal latencies unless outliers 
'+' occur (whiskers span 1.5 times the inter-quartile range). 

spectral-acoustic and phonetic information, since differences 
in stimulus duration and amplitude are eliminated. The com- 
parison revealed a significant negative component in the 
difference signal that seems to correlate with the VOT of the 
presented stimulus (see Steinschneider et al. who found that 
synchronized activity in the auditory cortex is time locked 
with consonant release and voicing onset [32]). Since the 
speech evoked Nl and P2 appear somewhat later and height- 
ened in contrast to the noise evoked, the present study sug- 
gests that the observed negative component denotes a second 
N1/P2 complex which is merged with the onset response. 
Therefore it is not directly visible. This second N1/P2 com- 
plex is suggested to reflect the onset of the vowel as an 
acoustic event. 

This study demonstrated that rapid spectral changes in 
the perception of speech can be assessed electrophysiologi- 
cally by means of a time-saving design. In case of modified 
cortical responses in subjects with central auditory process- 
ing disorders, this paradigm might serve as diagnostic tool 
for phonetic discrimination tasks since it reveals the neural 
processing of time-critical speech structures. 
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